Integrating a Structured-Text Retrieval System with an Object-Oriented Database System
نویسندگان
چکیده
We describe the integration of a structured-text retrieval system (TextMachine) into an object-oriented database system (OpenODB). Our approach is a light-weight one, using the external function capability of the database system to encapsulate the text retrieval system as an external information source. Yet, we are able to provide a tight integration in the query language and processing; the user can access the text retrieval system using a standard database query language. The e cient and e ective retrieval of structured text performed by the text retrieval system is seamlessly combined with the rich modeling and general-purpose querying capabilities of the database system, resulting in an integrated system with querying power beyond those of the underlying systems. The integrated system also provides uniform access to textual data in the text retrieval system and structured data in the database system, thereby achieving information fusion. We discuss the design and implementation of our prototype system, and address issues such as the proper framework for external integration, the modeling of complex categorization and structure hierarchies of documents (under automatic document schema importation), and techniques to reduce the performance overhead of accessing an external source.
منابع مشابه
Prototype for Integrating Probabilistic Fact and Text Retrieval
We describe a prototype for an information system that integrates text and fact retrieval. A query is a set of conditions which relate either to the text or the attribute values of a database object. Conditions may be assigned weights w.r.t. the query as well as to an object. These weights form the basis for a ranking of the database objects w.r.t. the query. As user interface, the system provi...
متن کاملIntegrating Diverse Information Management Systems: A Brief Survey
Most current information management systems can be classified into text retrieval systems, relational/object database systems, or semistructured/XML database systems. However, in practice, many applications data sets involve a combination of free text, structured data, and semistructured data. Hence, integration of different types of information management systems has been, and continues to be,...
متن کاملIntegrating INQUERY with an RDBMS to Support Text Retrieval
Information is a combination of structured data and unstructured data. Traditionally, relational database management systems (RDBMS) have been designed to handle structured data. IR systems can handle text (unstructured data) very well but are not designed to handle structured data. With present day information being a combination of structured and unstructured data, there is an increasing dema...
متن کاملEvaluation of object-relational database systems for fulltext retrieval
Object-relational database systems add object-oriented features to relational DBMS and allow the DBMS’s functionality to be extended to new application domains. For the important domain of fulltext retrieval and document management, we analyze whether current object-relational DBMS are already able to compete with specialized information retrieval (IR) systems. After discussing the main require...
متن کاملGeneral Database Infrastructure for Image Retrieval
In this article, a general database infrastructure implemented in an Object-Relational Database Management System for image retrieval is proposed and describe. Semantic and content based image queries can be performed with this application. The infrastructure is structured into three levels: content-based, semantic data and an interface integrating them. It is complemented with a set of databas...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1994